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Summary 


The  statistical  properties  of  a  cubic  smoothing  spline 
and  its  derivative  are  analyzed.  It  is  shown  that  unless  un¬ 
natural  boundary  conditions  hold,  the  integrated  squared  bias 
is  dominated  by  local  effects  near  the  boundary.  Similar  effects 
are  shown  to  occur  in  the  regularized  solution  of  a  translation- 
kernel  integral  equation.  These  results  are  derived  by  developing 
a  Fourier  representation  for  a  smoothing  spline. 
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1.  Introduction  and  Summary 


We  consider  statistical  properties  of  smoothing  splines  and  related 
procedures.  Given  x^  ■  f(t^)  +  e^,  1  =  l,..,,n  where  g  is  an  unknown 
smooth  function  and  the  e ^  are  random  errors,  a  cubic  smoothing  spline 
g(t;X)  is  the  function  which  minimizes 


n 


n  _  [xrg(t ±)V 

i=!  1  1 


+  X 


J(g~< 


u»  dt 


d.i) 


Smoothing  splines  were  proposed  by  Whittaker  (1923),  Schoenberg  (1964),  and 
Reinsch  (1967),  Some  analysis  of  their  statistical  properties  in  the  case 
that  g  and  f  are  periodic  appears  in  Wahba  (1975)  and  Rice  and  Rosenblatt 
(1981),  The  method  of  cross  validation  for  choosing  the  smoothing  parameter 
X  from  the  data  has  been  discussed  in  Craven  and  Wahba  (1979), 

Smoothing  splines  may  be  viewed  in  a  larger  context.  Given  x^  = 
(Af)(t^)  +  ei  where  A  is  a  linear  operator,  a  "regularized"  estimate  of 
f  is  the  function  g  which  minimizes 


n 


~  E  [x,-(Ag)(t  )]2 
i=l 


+ 


<g~(t))2  dt  . 


(1.2) 


Frequently  Af  is  of  the  form 


(Af ) (t) 


J  k(t,s)f(s)ds  . 


(1.3) 


Many  examples  of  this  type  may  be  found  in  Tikhonov  and  Arsenin  (1977). 

The  method  of  regularization  is  used  to  control  the  instability  that  would 
arise  if  one  tried  to  invert  A  or  A*A,  The  regularized  solutions  have  a 
formal  resemblance  to  ridge-regression  estimates;  in  both  cases  the  variance 


of  the  estimate  is  reduced  at  the  cost  of  increasing  bias.  Although 
there  is  a  large  literature  on  this  topic,  there  has  been  relatively  little 
analysis  of  the  statistical  properties  of  the  solutions. 


In  this  paper  we  examine  two  cases  of  (1.3),  numerical  differentiation 


(Af ) (t) 


f (u)du 


(1.4) 


and  deconvolution. 


(Af ) (t)  = 


w(t-s)f (s)ds  . 


(1.5) 


We  next  summarize  and  discuss  our  main  results.  Derivations  and  some 
further  results  are  contained  in  later  sections.  We  first  deal  with  a 
cubic  smoothing  spline. 

Consider  observations 

Xk  =  f  (k/n)  +  ek,  k  =  0,1 . n 

2 

with  f  continuously  differentiable,  f~  €  L  and  the  random  variables 

with 

Eekio 

E  Vj  ■  °2'  °2  *  0  • 

We  wish  to  determine  a  continuously  differentiable  function  g  *  g(t;X,n) 

2 

with  €  L  that  minimizes 


3 


1 

4 


n-1 

(x  +x  -g(0)-g(l))2  +  ^ 
u  n  k=l 


(x^-g(k/n)) 2 


I 

1 


(g-xt))2  dt  . 


(1.6) 


Here  X  ®  X(n)  >  0  and  the  object  is  to  determine  X(n)  as  a  function  of 
n  so  that 


r 


Etg(t)-f(t)r 


dt 


tends  to  zero  as  n  -►  00  at  a  rapid  rate.  The  term 

|  (x0+xn)  -  \  (g(0)+g(l)) 

appears  in  (1.6)  because  one  wishes  to  allow  for  the  possibility  that 
f(0)  4  f(l)  and  in  that  case  the  Fourier  series  of  f(t)  will  converge  to 
\  (f(0)+f(l))  at  t  =  0,1. 

Theorem  1.  Let  f  €  C2.  If  X3(n)n8  X(n)  0  as  n  -*■  »  then 

,  2.-1/ 4  7/, 

o2(g(t)]dt  S  -  3*2  7/2  . 

/  3  8 

Theorem  2.  Let  f  €  (;  .  Assume  that  X  (n)n  -*■  ”,  X(n)  -*-0  as  n  -*■  <*>. 
Then  if  f(2)(0)  or  f(2)(l)  *  0 

J [E  g(t)-f(t)]2  dt^{(f(2)(0))2  +  (f(2)(l))2}  X5M  2_3/2 

while  if  f(2)(0)  «  f(2)(l)  -  0  but  f(3)(0)  *  0  or  f(3)(l)  +  0  we  have 


4 


J l e  g(t)-f(t)]2  dt  a  {(f(3)(0))2  +  (f(3)(i))2} 

.7/4  ,-3/2  . 

A  3*2 

A  common  reason  for  nonparametric  data  smoothing  is  to  calculate  an 
extimate  of  the  derivative  of  a  function.  Schemes  for  numerically  differ¬ 
entiating  noisy  data  that  are  closely  related  to  the  derivative  of  a  smooth¬ 
ing  spline  have  been  proposed  in  Cullum  (1971)  and  Anderssen  and  Bloomfield 
(1974).  The  properties  of  the  derivative  of  a  smoothing  spline  follow 
fairly  directly  from  the  properties  of  the  smoothing  spline  itself. 

Theorem  3.  If  f  €  C  and  if  An  °°  as  n  -*  00  and 
A  ->  Oj  then 

f  V<t»at  -  r3/‘  •  a-7'2  ♦  o(n_1x”372)  . 

Jo 

Theorem  4.  Assume  that  f  (  d,  and  that  Xn5  -*■  Then  if 
f(2)(0)  +  0  or  f(2)  i  0 

[E  g'(t)-f'(t)]2  dt  S  [(f(2)(0))2  +  (f(2)(l))2]  •  X3/A  •  3  •  2~3/2  : 

If  f(2)(0)  =  f(2)(l)  -  0,  but  f(3)(0)  or  f(3)(l)  i  0  then 

[E  g'(t)-f'(t)]2  dt  9?  [<f(3)(0))2  +  (f(3)(l))2]X5/A  •  3  •  2'372  . 

Comparing  these  results  to  Theorems  1  and  2  we  see  that  the  variance  and 

-1/2 

integrated  squared  bias  of  the  derivative  are  a  factor  of  A  larger  than 

the  variance  and  integrate  square  bias  of  the  function  itself. 


Theorem  2  shows  that  the  integrated  squared  bias  is  dominated  by  con- 

fk ) 

tributions  from  the  boundary  unless  g  satisfies  the  condition  g  '(0)  = 

(k) 

g  (1)  *  0,  k  ■  2,3.  Lemma  6  of  section  3  gives  a  local  approximation 

to  the  bias  in  the  case  that  these  conditions  are  not  met.  Roughly,  the 

bias  decays  like  exp  (-2  ^^t)  trigonometrically  modulated.  In  the 

2 

interior  of  [0,1]  the  squared  bias  is  proportional  to  X  # 

These  results  are  not  unexpected.  The  smoothing  spline  is  a  "natural” 

spline  and  satisfies  the  two  arbitrary  end  conditions  f~(0)  *  f~(l)  =  0. 

In  the  context  of  pure  interpolation  the  use  of  a  natural  spline  is  usually 

2 

not  recommended  since  the  error  near  the  ends  is  of  order  h  where  h 
is  the  mesh  size  whereas  other  methods  can  produce  an  error  uniformly  of 
order  h4,  if  f  €  C4,  de  Boor  (1978),  Powell  (1981).  Similarly,  it  can 
be  shown  that  the  boundary  effect  dominates  the  integrated  squared  error, 
Rosenblatt  (1976).  In  the  nonstochastic  framework  methods  of  estimating 
the  boundary  constraints  have  been  proposed  in  these  references  and  it 
would  appear  plausible  that  a  similar  approach  might  work  in  the  stochastic 
case. 

Natural  splines  in  the  nonstochastic  setting  and  smoothing  splines  in 
the  stochastic  setting  are  the  optimal  solutions  of  certain  minimax  problems, 
Powell  (1981)  and  Speckman  (1981).  It  appears  that  flexibility  is  lost  by 
guarding  against  worst  cases. 

Smoothing  splines  have  also  been  proposed  in  the  case  of  spectral 
density  estimation  (see  Cogburn  and  Davis  (1974)  and  Wahba  (1980)).  Boundary 
effects  similar  to  those  studied  here  occur  in  the  case  of  periodic  smoothing 
splines  unless  the  function  is  smoothly  periodic  (see  Rice  and  Rosenblatt  (1980)) 


6 


The  aliasing  in  the  case  of  spectral  analysis  of  discretely  sampled  data 
implies  that  boundary  behavior  will  not  be  smooth  in  this  context. 

In  the  deconvolution  problem  we  consider  observations 

=  F(k/n)  +  k  =  0,...,n 


1  2 

where  F(k/n)  =  ^  w(k/n-u) f (u)duf  with  f~  €  L  and  the  uncorrelated 

2 

random  variables  with  mean  0  and  variance  o  .  The  regularized  approxima¬ 
tion  to  f  is  the  function  g  that  minimizes 


±  (x0+xn-G(0)-G(l))2 


n-1 

+  ^  (x.-G(k/n))2 

I*  i  1 

k=l 


(g~(t))2  dt  . 


(1.7) 


Here  G(k/n)  *  w(k/n-u) f (u)du.  The  kernel  of  the  integral  equation,  w, 

is  the  periodic  extension  of  a  function  defined  on  [0,1],  and  it  is 
2 

assumed  that  w  €  L  .  We  assume  that  the  Fourier  coefficients  w.  of  w 

k 

are  nonzero  for  all  k. 


The  constants  that  occur  in  the  asymptotic  expressions  for  the  compo¬ 
nents  of  the  integrated  mean  square  error  depend  on  the  exact  form  of  w, 
but  the  rates  of  decrease  depend  only  on  the  rate  of  decrease  of  the  Fourier 
coefficients  w^  of  w.  Paralleling  Theorems  1  and  2  we  have 


Theorem  5,  Let  f  €  c  and  suppose  that  |w. 


-20 


0  >  0.  If  Xn 


20+3 


* 


4  .  \ 


7 


V 


/„2[ ecoidt-n-1  x-<26+1>/<26+4)  . 

Theorem  6.  Let  f  €  C4  and  suppose  that  |  |  ^  ~  k~28,  8  >  0  and  Xn26+3  -*■  «® 

as  n  -*■  oo.  Then  if  f~(0)  or  f~(l)  /  0 

J [E  g(t)-f(t)]2  dt  ~  X5/(26+4)  . 

If  f~<0)  =  f~(l)  =  0  but  f(3)(0)  or  f(3)(l)  +  0,  then 

j [E  g(t)-f(t)]2  dt  ~  x7/(26+4)  . 

If  f(k)(0)  =  f(k)(l),  k  =>  2,3  then 

j[ E  g(t)-f(t)]2  dt  ~  x8/(26+4)  . 

Analytic  expressions  for  the  approximate  local  bias  are  not  available, 
but  the  qualitative  behavior  is  similar  to  that  of  a  smoothing  spline. 

Note  that  if  w  is  very  smooth,  6  is  large,  and  the  integrated 


mean  square  error  will  tend  to  zero  relatively  slowly. 


8 


2.  Examples 


2 

The  function  f(t)  -  cos  (2nt)  +  4  cos  (nt)  satisfies  f~(0)  -  -8tt  f 
f~(l)  =  0,  f^(0)  =  f^il)  *  0,  Figures  1  and  2  show  the  exact  bias  of 
the  smoothing  spline  estimate  of  the  function  and  its  derivative  for  50 
equi-spaced  sampling  points  and  X  =  10  The  effect  of  f~(0)  is  clearly 
evident.  The  asymptotic  analysis  (Lemma  6)  predicts  that  the  bias. 


b(t)  S  f~(0)X1/2  exp  (-t  2_1/2x"1/4) 


•[sin  (t  2_1/2X~1/4)  -  cos  (t  2-1/2X_1/4) ]  . 


From  this  expression  we  see  that  the  first  zero-crossing  of  the  bias  should 

occur  at  t  -  2  =  ,035  and  that  b^t)  should  be  zero  at  t  = 

-ft X 1/ ^  2  =  ,070,  which  is  borne  out  in  Figure  1.  Figure  2  shows  that 

-1/4 

the  bias  of  the  derivative  is  larger  by  a  factor  of  about  X 

We  next  consider  the  deconvolution  problem  wherein  f  is  convolved 

with  a  function  w,  the  graph  of  which  is  an  isosceles  triangle  centered 

at  0  with  height  20  and  base  .4.  This  is  intended  to  correspond  to  a 

situation  in  which  averaged  values  of  f  are  measured  with  error.  Since 

the  analysis  of  section  4  requires  that  w  be  periodically  extended,  the 

triangle  is  also  centered  over  -1  and  1.  To  calculate  the  bias,  (1.7) 

was  discretized  assuming  25  equi-spaced  observations  and  the  solution  was 

computed  at  50  equi-spaced  points.  Other  mesh  sizes  were  tried  to  insure 

that  the  results  did  not  merely  reflect  the  discretization.  The  calculations 

were  done  on  a  VAX  11/80  in  double  precision.  Figure  3  shows  the  bias  for 
—8 

X  *  10  ;  there  is  a  clear  effect  near  0  and  also  an  effect  near  1.  The 

shapes  are  qualitatively  similar  to  Figure  1. 


I 


Since  the  assumption  that  w  is  periodically  extended  is  clearly 
somewhat  artificial,  we  also  computed  the  bias  for  w  just  correspond¬ 
ing  to  a  triangle  centered  over  0.  The  resulting  bias  is  shown  in  Figure  4. 
Here  the  only  effect  is  near  0;  the  effect  near  1  of  Figure  3  is  apparently 
due  to  the  periodicity  of  w. 


10 


3.  The  Smoothing  Spline  and  Its  Derivative 

In  this  section  we  derive  Theorems  1-4  and  some  auxiliary  results. 

In  order  to  do  this  we  carry  out  a  Fourier  analysis  of  the  smoothing  spline. 

Notice  that 


gk 


jf  e2lTlktg(t)dt  =  Ag°ak  -  Ag1!^  +.  \bk 


for  k  ^  0  where 


(3.1) 


=  g(i)  -  g(0) 

Ag1  =  g'(D  -  g'(0) 


-  ■  f 


27tikt  . 

e  g  (t)dt 


Let 


and  set 


yi = 


I  2  (VXn} 


If  j  =  0 

if  j  =  1» . . . >n-l 


n-1 

y  =  —  y  exp  (2irijk/n)  . 

J  Sn  j=0  2 

Given  a  sequence  of  coefficients  pk  we  will  let  denote  the  correspond¬ 

ing  set  of  aliased  coefficients  arising  in  a  discrete  Fourier  analysis 


11 


12 


„  n-1  « 

i2  x-*  \y 


iwsV^EE 

L  kfto  s 


(3.6) 


hk+Sn'  +  S  |hsn' 


In  minimizing  this  expression,  one  can  separately  minimize  the  sum  of  the 
terms  with  k  fixed  for  each  value  of  k.  Minimizing  for  k  -  0  leads  one 
to  (3.3)  and  (3.4).  For  k  ^  0  we  have 


Vsn  *  C\  ~  (W  n  )bk+sn 


(3.7) 


Multiplying  by  ^>|c^sn  anc^  summing  over  s  leads  to 


(W 


(n)  _  Zkrk 


(3.8) 


and  this  together  with  (3.7)  leads  to  (3.5), 


Lemma  2.  Insert  (3.3).,  (3.4)  and  (3.5)  in  (3.6).  Minimizing  the  resulting 
expression  with  respect  to  Ag°,  Ag1  leads  to 


Ag°  = 


n-1  *  )  ( n-1 

y*  yk  “oo  ,  A 

a***  <k>||£i 


|a‘n)|2/(X+rk)j 


(3.9) 


(3.10) 
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If  we  Insert  (3.3),  (3.4)  and  (3.5)  In  the  expression  (3.7),  the  result 
can  be  written  as 


n-1  | *  | 2 
k=l  k 


(3.11) 


Minimizing  this  expression  with  respect  to  Ag*^  and  Ag^  leads  to  the 
following  equations 


n-1  a  — (n) 

y  zkak 

ti  x+rk 


=  0 


(3.12) 


n-1  -  (n) 

48  +f;  TST- 

k=l  k 


0  . 


(3.13) 


On  solving  for  Ag^,  Ag^  the  expressions  (3.7)  and  (3.10)  are  obtained. 


Lemma  3.  The  function  g  minimizing  (1.1)  has  Fourier  coefficients 


lr(n) 


b  =  —  +  Ag  b_ 

80  ^  8  0 


(3.14) 


8sn  =  A8°asn  ‘  Aglbsn  foP  s  *  0 


(3.15) 


and  for  k  *  n-1  and  s  integral 


A  0  r  1 _  | ,  1 2  (n) 

8k+sn  =  Ag  ak+sn  '  X+T  'VsJ  ak  1 

k 


(3.16) 


-  Ag1[b 


1  ‘b _ I2  bjn)l 


k+sn  X+r.  1  k+sn 1  k 
k 


,  "w  \ 

X+tk  JS 
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with  Ag°,  Ag1  given  hy  (3.9)  and  (3.10). 

The  fact  that  =  0  and  (3.3)  holds  lead  to  (3.14).  Also  (3.4) 

and  (3.5)  inserted  in  (3.1)  yield  (3.15)  and  (3.16). 

The  integrated  mean  square  error  of  g(t)  as  a  function  of  f(t)  is 


r 


E[g(t)-f(t)r  dt 


r 


s: 


var(g(t))dt  +  I  [E  g(t)-f(t)]  dt  . 


(3.17) 


Moreover 


var(g(t) )dt  =  var(gQ) 


+  2  ^  var(g  )  . 
k=l  K 


(3.18) 


It  should  be  noted  that  the  g^'s  are  complex-valued  random  variables.  The 
covariance  of  two  complex-valued  random  variables  U,V  is  understood  to  be 

cov(U,V)  =  E{ (U-EU) (V-EV) }  . 

We  shall  now  derive  Theorem  1.  Notice  that  Ag^  and  Ag^  are  real  even 
though  they  are  written  in  complex  form.  It  is  clear  that 

cov(?ryk)  =  --L]o2  (3.19) 

for  j,k  =  0,1,..., n-1.  From  (3.19)  it  follows  that 


Since 


n-1 


dx 


16 


if  X(n)n  ■+  00  as  n  -*■  ®,  it  can  be  seen  that 


o2(Ag°)  S  —  C2C”2X"1/4  , 


a2(Ag1)  S  2-  c4C32X'3M  , 


if  X(n)n  ■+  <*>  as  n  -*•  ®.  The  term 


(3.26) 

(3.27) 


n-1 

EE 

s  k=l 


k+sn  X+r, 


lhfcf.nl 


2  (n)  |  2 


(3.28) 


2  o 

occurs  as  a  coefficient  of  a  (Ag  )  in  contributing  to  (3,18).  However, 
(3.28)  can  be  approximated  by 


X2E - X*'’  c. 


2irik  _  „  ,1/4 


k  (X  |  2irk  |  4+l)  2 


with  an  error  0 


if  X(n)n  -►  00  as  n  -*  °°.  The  term 


E  E  lbk+3n  -  lv8„l2  ^n)l2 

s  k*l  k 


(3.29) 


(3.30) 


2  1 

arises  as  a  coefficient  of  o  (Ag  )  in  contributing  to  (3.18).  An  estimation 
procedure  similar  to  that  used  in  arriving  at  (3.39)  shows  that  (3.30)  can  be 
approximated  by 


2tt  ik  ] 


k  (X|2irkr+1) 


x3M  c 

2  _  X  C4 


(3.31) 


<7 
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111  3  8 

with  an  error  0 Hr  if  X  (n)n  +  «  as  n  -►  «.  The  estimates  obtained 
for  ( 3 * 28)  and  (3.30)  imply  that  the  contribution  to  (3.18)  from  the 
terms  involving  Ag^  and  Ag1  in  (3.16)  is 


0 


3  8 

if  X  (n)n  -*■  «  as  n  +  ».  Now  consider  the  contribution  from  the  last 
term  on  the  right  of  (3.16).  We  shall  see  that  it  makes  the  major  contri¬ 
bution  to  the  integrated  variance.  The  expression 


n-1 


|b 


k+sn 1 


X+r. 


1 

n 


(3.32) 


can  be  approximated  by 


T  _ i _ 

0<|k|<§  <l2irkl4+1> 


_  I  c* 
2  n  “ 


X‘1/4 


(3.33) 


where 


/« 


dx 


2ttx|  ^+1)^ 


with  an  error  0 


if  X(n)n  »  as  n  ■+*  ».  Theorem  1  follows 


from  these  estimates. 


Our  next  object  is  to  derive  Theorem  2  for  the  integrated  squared  bias 


of  g  as  an  estimate  of  f.  Notice  that  for  k  t  0  we  have 
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-f 


fk  =  J  e2l,lktf(t)dt  -  Af °ak  -  Af1!^  + 


(3.34) 


with 


\ 


t 


e2TTiktf^t)dt  . 


(3.35) 


Using  (3.34)  it  is  clear  tha* 


»  •  E 


n-1 


f  (f(o)+f (j»  »  JLj  f  =  J]  f 

k=-«  k  k=0  k 


(n) 


-  E 


i(j/n)  =  f.  exp  (-2wijk/n) 

j=-°° 


(3.36) 


n-1 

^  exp  (-2itljk/n) ,  j=l,...,n-l  . 

k=0 


This  implies  that 


E  yj/v^T  =  f jn) ,  J  -  0,1,..., n-1 


(3.37) 


From  (3.34)  it  follows  that 


c (n)  _  Ar0  (n)  Airl,  (n)  .  /  ,  N(n) 

fj  *  Af  a^  -  Af  bj  +  t  j  *  n-1 


Relations  (3.9),  (3.10),  and  (3.37)  imply  that 


.  !•£'->*<«  ml 


* 

% 


I 


E  a  -f 

sk+sn  k+sn 
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0  Af0N  (. 


k+sn  1 


(n)  | 


(E  Ag  "Af  >  jak+sn  "  ~TVrk  '  3k  ) 

-  (E  Ag^Af1)  jb  -  — ^sn—  b^n)i 
|  k+sn  A+r^  k  1 


•  /_  l  \ (n)  1  k+sn 

h^c  k  A+r,  mk+sn  k+sn 

k 


(3.44) 


Further,  if 


f  €  C 


we  have 


\  =  Af2ak  -  Af\  + 


(3.45) 


with 


Af2  =  f(2)(l)  -  f(2)(0)  , 
Af3  =  f(3)(l)  -  f(3)(0)  , 


;(4) 


■f 


exp  (27rikt)f  ^  (t)dt  . 


The  last  term  on  the  right  of  (3.44)  can  then  be  rewritten  as 


. r ^  I  /  .  v(n)  I bk+sn I  . 

|  k  k  ^+rk  &k+sn  k+sn | 

2  v 

-&f3j(b2)(n)  'V-gpi  -  b2 

)  k  X+rk  k+sn  J 

,  L(4>,2  (n)  'bk+sJ2  f(4)  ,2 
+  j(fk  V  X+rk  "  fk+2nVsn 


(3.46) 
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/ 


Cn(t)  =  — -  /  - — 7 -  dx  =  — e  ^2  (cos(t2  b  +  sin(  1 1 1  2— ^ ) ) 
'  1+x  2/2 


Successive  differentiation  then  indicates  that 


C^t) 


c2(t) 


c3(t) 


*/ 

*/ 

*/ 


e*tXix  j  1  1 1 1 2  ^ 

- 7—  dx  =  -  7  e  1  sin(t2  )  , 

1+x 

^i^0ldx  =_l_e-M2'i  (8in(|t|2-*)-coS  (t2“*))  , 
1+x4 

dx-isgnte-l^cos  (t2“*)  . 

1+x4  2 


An  application  of  the  Poisson  summation  formula  tells  us  that 


B  .  (t)  =  X 


3-j 


c^ak-tn  >  . 


(3 


Only  the  terms  in  the  sum  (3.48)  corresponding  to  k  =  0,  k  =  1  need  to 


be  considered  since  the  sum  of  the  remaining  terms  die  off  at  the  rate 


-aX 


with  a  a  positive  constant.  Notice  that  the  formulas  for  the 


Cj(t)  above  imply  that 


C  =  C  =  - 

1  3  2/2 


4  2 

Lemma  5.  Assume  that  f  €  C  .  Then  if  Af  i4  0 


E  Ag°-Af°  ^  -  X1^2Af 2 


while  if  Af  =0 


.48) 


(3.49) 
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E  Ag°-Af°  St  lH  x3/4  (f(3)(0)  +  f(3)(l)}. 

If  f(2)(0)  +  f(2)(l)  *  0  we  have 

E  Ag3-Af 1  fit  -  2«£  X1/4  (f(2)(0)  +  f (2)  (1) } 

and  if  f(2)(0)  +  f(2)(l)  =  0 


AgX-Af 1  fit  Af3X1/2 


as  A  =  A(n)  -*  0. 


The  asymptotic  relations  (3*49)  and  (3.50)  follow  from  (3.40), 
and  (3.45).  Formula  (3.51)  is  a  consequence  of  (3.41)  and  (3.42). 
f^2\o)  +  f^(l)  ■  0,  since  l  Re  =  j  (f^(0)  +  f^(l))  one 


see  that 


Re  \ 
1+X(2nk)4 


-E 


X  (  27tk)  H  Re 
1+A(27rk)^ 


However  by  (3.45) 


Af~ 


(2Trk)' 


(f^(x)  +  f  ^  (-x) )  cos  27rkx  dx 


This  implies  (3.52). 

Lemma  6 .  Let  f  €  C  •  If  f^  ^  (0)  ^  0,  f^  (1)  =  0  then 


(3.50) 


(3.51) 


(3.52) 

(3.43) 

If 

can 

(3.53) 


(3.54) 


,T  ■'  ^  ^ 
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E  g(t)-f(t)  -  f(2)(0)  A^e”t2  4  [sin(t2~*X_i)  -  cos(t2-iX-i) ] 


+  e(t)  , 


(3.55) 


0  <  t  <  1,  where  the  error  term  e(t)  is  such  that 


/■ 


/■ 


e(t)2  dt  *  o(  /  [E  g(t)-f(t>]2  dt 


(3.56) 


If  f(2)(0)  =  f(2)(l)  =  0,  f(3)(0)  +  0,  f(3)(l)  =  0,  we  have 


“ii'i 


E  g(t)-f(t)  =  f(3)(0)X3/4  Jl  e't2  X  cos(t2"iX"i)  +  e(t)  (3.57) 


0  <  t  <  1,  where  the  error  term  again  satisfies  (3.56).  The  approximations 
appropriate  for  the  oases  f^(0)  *  0,  f^23(l)  f  0  and  f^(0)  = 
f^(l)  =  0,  f^(0)  =  0,  f^(l)  t  0  ^re  obtained  by  replacing  t  by 
1-t  in  the  main  expressions  on  the  right  of  (3.55)  and  (3.57)  respectively . 

We  next  consider  the  variance  and  bias  of  the  derivative  g"  of  the 
smoothing  spline*  Theorems  2  and  3  follow  from  the  previous  analysis  of  g, 
after  noting  that  the  Fourier  coefficients  of  are 


go =  Ag 


(3.58) 


gk  “  akAg  "  V\,  k  t  0  . 


(3.59) 


We  first  consider  the  integrated  squared  variance 


V  =  £°2(gk>  • 
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From  0.26), 


°2(go)  -  IT  c2ci2x‘1/4 


As  in  (3.16) 


3k+snAg  "  ak+snhk+sn  "  ak+snAg  (*  -  A+r  bk+snbk 


,  * 

_l  _  (n)A  0  ak+sn  k+sn  ^k 

+  ak+snak  Ag - 


(3.60) 


Estimates  similar  to  those  used  in  the  analysis  of  the  smoothing  spline 

show  that  the  contribution  to  the  variance  from  the  first  term  is  of  order 

-1  -1 
X  n  .  The  second  term  gives  a  contribution  of  order  X  n  ;  the  third 

term  dominates,  giving  a  total  contribution  to  V 


n 


[2rrikr 


[A(2irik)4+1]2 


=  —  A 
n 


-3/4 


L 


(2ttx)' 


»  ((2ttx)Vi)^ 


dx 


(3.61) 


Next,  the  bias: 


E  8k+sn  -  fk+sn  =  ak+sn(E  Agl  "  Afl) 


^^sn^  *V+sn  ^-l-sn^ 


(3.62) 


which,  as  in  (3.44) 
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Making  approximations  as  in  the  analysis  of  the  spline  function  itself, 

E  g'(t)-f'(t)  3*  (E  Ag°-Af°)  +  (E  Ag°-Af°)  X-1  BQ(t) 

+  (E  Ag^Af1)  B3(t)  +  Af2B2(t)  -  Af3Bx(t)  . 

(2) 

Using  the  Poisson- summation  approximation  and  Lemma  5  if  fv  (0)  ±  0, 


E  g'(t)  -  f'(t)  S  f(2)(0)  2*  X*  e“U  cos  u 
where  u=  2_i  X_it.  If  f(2)(0)  =  f(2)(l)  =  0,  and  f(3)(0)*0,  f(3)(l)=0 

E  g"(t)  -  f"(t)  “  -  f(3)(0)  X*  e  U  (sin  u  +  cos  u)  . 

Note  that  the  approximate  (in  an  sense)  bias  of  the  derivative  is 

the  derivative  of  the  approximate  bias  (Lemma  6). 
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4.  Deconvolution 

We  now  sketch  the  development  of  the  deconvolution  results.  Since 
this  parallels  closely  the  derivations  of  Section  3  the  presentation  will 
be  somewhat  sketchier.  As  before  let  g  have  Fourier  coefficients 

gk  *  Ag°ak  -  AgH>k  +  \\  k  ^  0  (4.1) 


and  let 


\  =  wk  \ 

\  =  wk  bk 
\  =  wk  bk  \ 


and  define 


as  in  Section  3.  Then  (1.7)  may  be  written  as 


(A. 2) 


Minimizing  the  0tb  term  gives  hgn  *  0,  s  #  0,  and  +  Gq11^  =  Yq 

As  in  the  analysis  of  Section  3,  we  first  fix  Ag^  and  Ag^  and 
minimize  with  respect  to  the  h^'s.  If 
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_  Ll  *  .  (n)  .  lR(n) 

z,  *  — *■  -  Ag  A.  +  ig  B, 


*  & 


Then  (4.2)  becomes 


E'  ivHj(n)|2  +  x[Ugl)2  +2T  i\ 


t  2  I B  1-2 

;+sn*  1  k+sn 


i 


(4.3) 


The  minimizing  coefficients  can  be  calculated  to  be 


h  =  B 
j+sn  j+sn  A+p 


(4.4) 


2  0  1 
where  p  =  Y  Ib^,  I  .  Now  to  calculate  the  minimizing  Ag  and  Ag  , 

i  sJic  J+sn 

this  solution  is  substituted  back  into  (4.3)  to  give 


X  I  Zj  | 2  CX+Pj)-1  +  X(AgV  . 

Minimizing  this  with  respect  to  Ag®  and  Ag1  amounts  to  solving  two 
linear  equations  in  two  unknowns,  and  it  may  be  seen  that  the  solution 
is  approximately 


(4.5) 


(4.6) 


Ag1  =*• 


-  j  Af1  +  Re£  ^  B^n)  (X+Pj)"1!  jl  +  £  lBj(n)!2  <X+Pj>_1| 


(4.7) 


We  next  consider  the  integrated  squared  variance,  which  is  the  sum  of 


the  variance  of  the  Fourier  coefficients  of  g.  Now,  from  above. 


j+sn 


°  r 

Ag  a 


j+sn 


_  1+sn 1 


|2w  A(n)l 

It?"  3 

hPJ  J 


4 


|b  !2wi+  B.(n) 

+  Ag"  lb,  _  ‘.j±snl  j+sn  J 
6  1  j+sn  X+Pj 


] 


,  "w  h 

x+pj  ^  ' 


(4.8) 


Via  approximations  similar  to  those  in  Section  3,  it  may  be  seen  that  the 
first  two  terms  contribute  a  net  variance  of  order  n  a  v  7  whereas 
the  third  term  contributes  the  dominating  variance,  which  is  of  order 

-l.-(2B+l)/(2B+4) 

n  A  * 

If  we  write  the  Fourier  coefficients  of  f  as 

fk  =  Af°ak  "  Af\  +  (4.9) 


and  take  expectations  in  (4,8),  the  bias  of  the  (j+sn)*'*1  Fourier  coefficient 
may  be  expressed  as 


E  W'kwn  '  <E  48°-af°)  [' 

-  (E  4g1-4f1)  Tl 


k+sn 


X+Pu 


] 


k+sn 


X+Pu 


] 


,  lbk+8nl2wk+snMkn) 

X+p.  "Sc+sn  k+sn 


(4.10) 


As  in  section  3  for  |k|  <  n/2,  k  i*  0 


E  g  -f  3  (E  Sg°-lf°)  — ^ 

"  k  »+|Bj2 


1  1  Xbk 

-  (E  Ag-Af  )  - S_ 

x+iBvr 


+  Af 


2  Xakbk 


X+  B, 


-  Af 


3  Xbk 


*+K 


If  we  let 


Xfk4>bil 

k  k 
X+|B.  |2 


D,(t)  =  X  T  exp  (-2irikt)  j*  0,1, 2, 3 

j  k  x+|b,  I2 


(note  that  IlD^ ||2  ~  x(7-2j)/(4+2 6) )  then 

E  g(t)-f(t)  S*  (E  Ag°-Af°)D3(t)  -  (E  AgJ-Af 1)D2(t) 


+  Af2D1(t)  -  Af3DQ(t) 


f<4)  b2 

+  X^^  - 2  exP  (~2uikt) 


A+|B- 
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The  function  Dj(t)  play  the  role  of  the  functions  B^(t)  °f  section  3. 
Although  their  exact  analytical  forms  depend  on  w,  they  are,  like  the 


Bj's,  successively  odd  and  even,  and  are  increasingly  peaked  near  0 
and  1  as  X  ■*  0. 


We  now  consider  the  individual  terms  in  (4.13).  From  (4.6)  it 
follows  that 


,  .  0  ,,o  1  "kBkAk<Ml,k> 

E  Ag  -Af  = 


-1 


l  |AV.|2CA+Pk)_1 


The  denominator  can  be  estimated  to  be  ~  x^/(26+4)^  jf  Af  jt  0,  the 


numerator  is 


*4£2El»kl2(»pk)'l~rw'«+4'  . 


In  combination  with  this  gives  a  net  contribution  to  the  integrated 

squared  bias  which  is  ~  ^5/ (23+4)^  ^  Af^  _  q  the  numerator  is 
a*  (f(^)(l)  +  f^^(0))/2,  giving  a  net  contribution  of  order  xJ  ^  \ 

If  f^^(0)  *  f^(l)  -  0  k  *  2,3  the  net  contribution  is  O(A^). 

Next, 


E  Ag1-Af 1 


Af  *  +  £  m1  |B1  |2(^4pk)' 
l  +  l  I  Bj  |  2(^+Pk>_1 


The  denominator  is  ~  X  ^  and  if  f^  ^(1)  or  ^(0)  /  0  the 

numerator  is  (f^2\l)  +  f^(0))/7..  This  gives  a  net  contribution  to 
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the  integrated  squared  bias  of  order  ^5/( 28+4)^  both  second  derivatives 
are  zero  the  numerator  is 


a1/(26+A) 


giving  a  net  contribution  of  order  ^7/ (28+4) ^  ^  both  second  and  third 

2 

derivatives  vanish  at  1  and  0  the  net  contribution  is  0(X  ). 

The  last  term  in  (4.11)  can  be  estimated  and  makes  a  contribution  to 

2 

the  integrated  squared  bias  of  order  X  . 


,  *  *  ' 
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